Optimal Tag Sets for Automatic Image Annotation
نویسندگان
چکیده
In this paper we introduce the Beam Search CRM (BS-CRM) model. This model implements two novel improvements to the basic CRM [2]. First, we argue that using a Minkowski kernel allows us to capture the covariance of visual features more effectively than the standard Gaussian kernel. Second, we advocate a procedure that selects the most informative subset of tags as the image annotation. Our procedure captures the mutual dependence within a set of tags, and naturally prevents noisy tags from being assigned during the search procedure. In automatic image annotation the basic objective is to find the set of tags w = {w1 . . .wk} that serves as the best annotation for the test image represented with a set of feature vectors f = {~f1. . .~fm}. The traditional approach used by [2] and many subsequent publications [3] [5] [4] involves estimating the marginal probability distribution over individual tags P(w|f) and annotating the image with top-ranked tags from that distribution. This approach however does not take into consideration any correlation between the tags: the top-ranked tags could be incohesive and contradictory, e.g. {tropical, blizzard, supernova}. Beam Search: To address both of the above issues, we propose to annotate images with the most informative subset of tags. We define the amount of information I(w) present in a set of tags w as the expected excess number of bits required to encode this set with the background model: I(w) = P(w|f) · log P(w|f) P0(w) .
منابع مشابه
Tags Re-ranking Using Multi-level Features in Automatic Image Annotation
Automatic image annotation is a process in which computer systems automatically assign the textual tags related with visual content to a query image. In most cases, inappropriate tags generated by the users as well as the images without any tags among the challenges available in this field have a negative effect on the query's result. In this paper, a new method is presented for automatic image...
متن کاملFuzzy Neighbor Voting for Automatic Image Annotation
With quick development of digital images and the availability of imaging tools, massive amounts of images are created. Therefore, efficient management and suitable retrieval, especially by computers, is one of themost challenging fields in image processing. Automatic image annotation (AIA) or refers to attaching words, keywords or comments to an image or to a selected part of it. In this paper,...
متن کاملScalable Image Annotation by Summarizing Training Samples into Labeled Prototypes
By increasing the number of images, it is essential to provide fast search methods and intelligent filtering of images. To handle images in large datasets, some relevant tags are assigned to each image to for describing its content. Automatic Image Annotation (AIA) aims to automatically assign a group of keywords to an image based on visual content of the image. AIA frameworks have two main sta...
متن کاملAutomatic Image Annotation Using Decision Trees and Rough Sets
The process which attaches label to a digital image by understanding the contents of image is termed as Automatic Image Annotation (AIA). Color and texture are the prominent features of a digital image. The content based image understanding is possible by using the feature strength of color and texture of an image. A classifier is designed using Decision Trees (DT) and Rough Sets (RS) to tag un...
متن کاملA bipartite graph model for associating images and text
The joint modeling of image and textual content is even more important now because of the the availability of large databases of image-rich web pages and the tagging phenomenon. Much of the current work focused on one-way association (image to text or tags). The association is often captured by building a model with hidden variables. In this paper, we propose a simple model based on random walk...
متن کامل